MFMBRANE-ASSOCTATED IMMUNOGENS OF MYCOBACTERIA 
Technical Field of the Invention 

The invention relates to membrane-associated 
polypeptides of mycobacteria and, in particular, the 
use of such polypeptides and the nucleic acids encoding 
them for use as vaccines and diagnostic reagents. 

Background of the Invention 

The mycobacteria are a diverse collection of acid fast, 
gram-positive bacteria, some of which cause important 
human and animal diseases. In humans, the two most 
common mycobacteria-caused diseases are tuberculosis 
(TB) and leprosy, which result from infection with M. 
tuberculos'is and M. leprae , respectively. 

Tuberculosis displays all of the principal 
characteristics of a global epidemic disease. 
Currently, tuberculosis afflicts more than 35 million 
individuals worldwide and results in over 4 million 
deaths annually. In India, at any given time, almost 
8 million people are reported to suffer from this 
disease and 500,000 deaths recorded. These figures may 
not cover the totality of those suffering from this 
disease in this country. Thus, tuberculosis appears to 
be a problem of major concern in India as also in many 
other countries of the world. 
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Tuberculosis is caused by M. tuberculosis , Jl. bovis, J4. 
africanum and M. microti , the acid-fast. Gram positive, 
tubercle bacilli of the family Mycobacteriaceae. Some 
local pathogenic strains of M. t ube r cu 1 o s i s have also 
5 been isolated from patients in Madras and other cities 
in India, which differ in some respects from 
tuberculosis H37Rv, which is a virulent strain. 

In recent years, certain groups of individuals with 
AIDS have been found to have a markedly increased 

10 incidence of TB as well. It has now been shown that 
one group of mycobacteria which consists of U. avium , 
M. intracel lul are and M. scrofulaceum , jointly known as 
MAIS complex, is responsible for disseminated disease 
in a large number of persons with AIDS (Kiehn et al. , 

15 J. Clin, Microbiol . , 21:168-173 (1985); Wong et al. , 
Amer. J. Med. , 78:35-40 (1985)). 

Since Koch identified M. tuberculosis as the causative 
agent of tuberculosis in 1882, many scientific studies 
and public health efforts have been directed at 

2 0 diagnosis, treatment and control of this disease. 

However, characteristics of M. tuberculosis have 
hampered research to improve diagnosis and to develop 
more effective vaccines. In addition, the biochemical 
composition of the organism has made identification and 
25 purification of the cellular constituents difficult, 
and many of these materials once purified, lack 
sensitivity and specificity as diagnostic reagents. As 
a result, diagnostic and immunoprophy lactic measures 
for mycobacterial diseases have changed little in the 

3 0 past half century • The conventional methods for the 

diagnosis of M. tuberculosis are troublesome and 
results are delayed. 

Bacillus Calmette-Guerin (BCG) , an avirulent strain of 
M. bovis (Calmette, A., Masson et Cie , Paris (1936)), 
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is used extensively as a vaccine against tuberculosis* 
Though numerous studies have found that it has 
protective efficacy against tuberculosis (Luelmo, F. , 
Am, Rev. Respir. Pis. . 125, 70-72 (1982)) BCG has 
5 failed to protect against tuberculosis in several 
trials (WHO, Tech. Rep, Ser. , 651:1-15 (1980)) for 
reasons that are not entirely clear (Fine, P. , 
Tubercle , 65:137-153 (1984); Fine, et al., Lancet , 
(ii) :499-502 (1986) ) . 

10 The eradication with vaccination, early diagnosis, and 
efficient therapy is an important objective of the 
drive to combat mycobacterioses. The lacunae in the 
present knowledge of the biology of these pathogens - 
their make-up, their natural history, their physiology, 

15 biochemistry and immunological reactivities, highlights 
the need for attempts to unravel their weaknesses, so 
that more efficient ways to combat this disease can be 
devised. To develop more effective tools for the 
diagnosis and prevention of these diseases, it is 

20 important . to understand the immune response to 
infection by mycobacterial pathogens. The 
mycobacterial components that are important in 
eliciting the cellular immune response are not yet well 
defined. The antibody and T-cell responses to 

25 infection or inoculation with killed mycobacteria have 
been studied in humans and in animals. Human patients 
with TB or leprosy produce serum antibodies directed 
against mycobacterial antigens. Although antibodies 
may have some function in the antimycobacterial immune 

30 response, the exact function remains to be clarified 
since no protective role can be ascribed to these 
antibodies. Protection against mycobacterial diseases 
involves cell-mediated immunity. 

Mycobacteria do not produce any directly toxic 
3 5 substances and consequently their pathogenicity results 
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from multiple factors involved in their interaction 
with the infected host. Intracellular parasitism 
probably depends on host cell trophic factors; it is 
conceivable that their short supply may be 
5 bacteriostatic and could play a role in the mechanism 
of mycobacterial dormancy. 

It is generally understood that protective immunity in 
mycobacterial infection is mediated by specific T cells 
which activate macrophages into non-specific 

10 tuberculocidal activity. Evidence suggests that gamma- 
I FN triggers macrophages towards H 2 0 2 -mediated 
bacterial killing, but related or other macrophage 
activating factor (MAF) molecules may also be involved. 
The causes responsible for the inadequate bactericidal 

15 function at sites of abundant T cell proliferation have 
not yet been explained. Dissociation between delayed- 
type hypersensitivity (DTH) and protective immunity led 
to views that T-cells of a distinct subset or 
specificity could be responsible for the acquired 

20 resistance to mycobacterial infection. Alternatively, 
interference with protection may result from corollary 
cellular reactions, namely by suppressor T-cells and 
macrophages, or from the shifting of T-cells towards 
helper function for B-cells. 

25 Unlike viral and some parasite pathogens which can 
evade host resistance by antigenic shift, mycobacteria 
have a resilient cell wall structure and can suppress 
host immune responses by the action of their 
immunomodulatory cell wall constituents. Whilst the 

3 0 success of protective immunization towards other 
microbial pathogens mainly depends on quantitative 
parameters of immunity, it appears that mycobacterial 
immunomodulatory stimuli produce a regulatory 
dysfunction of the host immune system. This may not be 

35 possible to override simply by more resolute 
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immunization using vaccines of complex composition such 
as whole mycobacteria (e.g. BCG) . Perhaps mycobacteria 
did not evolve potent "adjuvant" structures to boost 
the host immunity but rather to subvert host defenses 
5 towards ineffective cellular reactions operating to the 
advantage of the pathogen. Vaccination with an 
attenuated pathogen such as BCG could amplify further 
immune responses but with limited protection of the 
host, the potential scope for immunization with defined 
10 antigens is yet to be explored. 

The purification and characterization of individual 
antigenic proteins are essential in understanding the 
fundamental mechanism of the DTH reaction on the 
molecular level. The possible functional role of 

15 proteins of defined structure in the pathogenesis of 
mycobacterial diseases as well as for diagnostic 
purposes remains of great interest. Numerous groups 
have attempted to define mycobacterial antigens by 
standard biochemical and immunological techniques, and 

20 common as well as species specific antigens have been 
reported in mycobacteria (Minden, et al. , Infect. 
Immun. , 46:519-525 (1984); Closs, et al., Scand. J. 
Immunol . , \2 : 249-263 .( 1980) ; Chaparas, etal., Am. Rev. 
Respir. Pis, . 122:533 (1980); Daniel, . et al., 

25 Microbiol. Rev. . 42:84-113 (1978); Stanford, et al., 
Tubercle . 55:143-152 (1974); Kuwabara, S., J. Biol. 
Chem. . 250:2556-2562 (1975)). 

Very little information about the mycobacterial genome 
is available. Initially, basic studies were conducted 

3 0 to estimate the genome size, G+C content and the degree 
of DNA homology between the various mycobacterial 
genomes (Grosskinsky , et al., Infect. Immun. . 57, 
5:1535-1541 (1989); Garcia, et al. , J. Gen. 
Microbiol. . 132:2265-2269 (1986); Imaeda, T. , Int. J. 

35 Svs. Bacterid. , 35, 2:147-150 (1985); Clark-Curtiss , 
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etal., J. Bacteriol . . 161 3:1093-1102 (1985); Baess, 
I. et al., B. , Acta, Path. Microbiol. Scand. , (1978) 
86:309-312; Bradley, S. G., Am. Rev. Respir. Pis. P 
106:122-124 (1972)). Recently, recombinant DNA 

5 techniques have been used for the cloning and 
expression of mycobacterial genes. Genomic DNA 
fragments of M. tuberculosis . M. leprae and some other' 
mycobacterial species were used for the construction of 
lambda gtll phage (Young, et al., Proc. Natl. Acad. 

10 Sci. . U.S.A., 82:2583-2587 (1985); Young, et al.. 
Nature (London), 316:450-452 (1985)) or other vector- 
based recombinant gene libraries. These libraries were 
screened with murine monoclonal antibodies (Engers, et 
al., Infect. Immun. , 48:603-605 (1985); Engers, etal., 

15 Infect. Immun. , 51:718-720 (1986)) as well as 
polyclonal antisera and some immunodominant antigens 
were identified. The principal antigen among these 
being five 12, 14, 19, 65 & 71 kDa of M. tuberculosis 
(Young et al., Proc. Natl. Acad. Sci. , U.S.A., 82:2583- 

20 2587 (1985); Shinnick et al., Infect. Immun. , 
55 (7) : 1718-1721 (1987); Husson and Young, Proc. Natl. 
Sc. Acad. . 84:1679-1683 (1987); and five 12, 18, 23, 36 
& 65 kDa antigens of M. leprae (Young, et al., Nature 
(London), 316:450-452 (1985)). A few homologues of 

25 some of these antigens were also identified in some 
other mycobacterial species (e.g., BCG) (Yamaguchi et 
al., FEB 06511 , 240:115-117 (1988); Yamaguchi et al., 
Infect. Immun. , 57:283-288 (1989); Matsuo, et al., J. 
Bacteriol. . 170, 9:3847- 3854 (1988); Radford, et al*, 

30 Infect. Immun. . 56, 4:921-925 (1988); Lu, et al., 
Infect. Immun. . 55, 10:2378-2382 (1987); Minden, et 
al., Infect. Immun. , 53, 3:560-564 (1986); Harboe, et 
al., Infect. Immun. . 52, 1:293-302 (1986); Thole, et 
al., Infect. Immun. . 50, 3:800-806 (1985)). These 

35 antigens, however, are either intracellular or secreted 
molecules. 



Although M. bovis BCG has been widely used as a vaccine 
against tuberculosis, the determination of the 
membrane-associated polypeptides of mycobacterium that 
are capable of inducing a protective immune response is 
highly desirable. The use of such a membrane- 
associated polypeptide or the DNA encoding it provides 
for the generation of recombinant vaccines, e.g. , 
mycobacterial membrane-associated immunogens expressed 
in, for example, a virus or bacterium such as vaccinia 
virus, Salmonella, etc. used as a live carrier, or the 
display of non-mycobacterial immunogens on the surface 
of a cultivable mycobacterial strain which can be used 
as a live recombinant vaccine. 

Accordingly, it is an object herein to provide methods 
for identifying and isolating nucleic acids encoding a 
membrane-associated polypeptide of mycobacteria. 

Further, it is an object herein to provide membrane- 
associated polypeptides of mycobacteria and the nucleic 
acids encoding it. 

sti11 further, it is an object herein to provide 
vaccines utilizing all or part of the membrane- 
associated polypeptide of a mycobacterium or the DNA 
encoding such membrane-associated polypeptide. 

Still further, it is an object to provide reagents 
comprising said membrane-associated polypeptide with a 
mycobacterium or DNA encoding it useful in diagnostic 
assays for mycobacterial infection. 

Still further, it is an object to provide a promoter 
sequence comprising the promoter of said membrane 
associated polypeptide, which can direct gene 
expression in mycobacteria as well as in other 
microorganisms such as E. coli . 
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Summary of the Invention 

In accordance with the foregoing objects, the invention 
includes compositions comprising nucleic acid encoding 
all or part of a membrane-associated polypeptide of a 
5 mycobacterium and the membrane-associated polypeptide 
encoded by said DNA. The membrane-associated 

polypeptide is characterized by the ability to detect 
an immune response to pathogenic mycobacteria or the 
mycobacteria from which the membrane associated 
10 polypeptide or part thereof is derived. Such 
mycobacteria include M. bovis , M. tuberculosis . M. 
leprae . M. af ricanum and M. microti . M. avium . M. 
intracellular and M. scrof ulaceum and M. bovis BCG . 

A particular mycobacterial membrane-associated 
15 polypeptide is a 79 kD ion-motive ATPase. Extra- 
cellular, intra-cellular and transmembrane domains are 
identified in this mycobacterial membrane-associated 
polypeptide based upon its DNA and deduced amino acid 
sequence . 

20 The invention also includes vaccines utilizing all or 
part of a membrane-associated mycobacterial 
polypeptide or an expressible form of a nucleic acid 
encoding it. The invention also includes 

mycrobacterial promoter sequences capable of directing 

25 gene expression in mycobacteria as well as in other 
microorganisms such as E. coli . Such promoters are 
from mycobacterial genes encoding membrane-associated 
ATPases. A preferred promoter is that of the gene 
encoding the M. bovis BCG 79 kD membrane-associated 

3 0 polypeptide. This promoter sequence is especially 
useful to express genes of interest in mycobacteria. 
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Brief Description of the Drawings 

Figure 1 illustrates the results of immunoscreening of 
recombinant colonies carrying M. bovis BCG DNA (panel 
A) and M. tuberculosis H37Rv DNA (panel B) , using sera 
5 from TB patients in which the presence of M. bovis BCG 
antigens and M . tuberculosis H37Rv antigens capable of 
reacting with the /antisera^) is indicated by a 
qualitative signal* ^ 

Figure 2 shows the comparison of restriction site maps 
10 of recom binan t clones carrying BCG DNA identified using 
the immunoscreening assay^cTescribed herein (panel B) 
with trfe— restr iction s ite—maps of five immunodominant 
antigens of M. tuberculosis and M. bovis BCG genomic 
DNAs, respectively, (Husson and Young, Proc. Natl. 
15 Acad. Sci. , U.S.A., 84:1679-1683 (1987); Shinnick et 
al., Infect. Immun. . 55:1718-1721 (1987) (panel A)). 
Restriction maps in each panel have been drawn to the 
same scale (indicated at the top) , and restriction 
sites are indicated above the restriction maps. The 
2 0 dotted line in panel A represents the noh-mycobacterial 
DNA. Restriction enzymes: B, BamHI, E, EcoRI, G> 
Bglll, K, Kpnl, P, Pvul, X, Xhol , H, Hindi, U, PvuII, 
Ps, PstI, Hi, Hindlll. In panel A, A is Sail and S is 
Sacl. In panel B, S is Sail. 



25 Figure 3 illustrates the results of Western blot 
analysis of the sonicated supernate of recombinant 
clone pMBBSIA which carries a BCG DNA insert identified 
following immunoscreening of the recombinant colonies • 
The top panel shows reactivity of MBB51A (lane 2) and 

3 0 E. coli (lane 1) with seraifr om TB patients. The 
bottom panel (part A) shows reactivity of MBB51A (lanes 
1 and 2) and E. coli (lane 3) with anti-H37Rv sera 
raised in rabbits. Part B shows reactivity of MBB51A 
(lanes 1 and 2) and E. coli (lane 3) with the second 




-10- 



antibody alone. Arrows indicate the position of the 90 
kD itcununoreactive BCG protein expressed by the 
recombinant MBB51A, which was absent in the negative 
control . 

5 Figure 4 illustrates the nucleotide sequence (Seq. ID 
No.: lj of clone pMBBSIA 3.25 kb insert DNA containing 
the M. bovis BCG immunoreactive MBB51A gene encoding an 
ion-motive ATPase, with a deduced molecular weight of 
79 kD. The deduced amino acid sequence (Seg. ID 
10 No.: 2) is shown below the nucleotide sequence. 
Upstream promoter elements are underlined. 
Transcription termination region is indicated by 
inverted arrows. 5' and 3' flanking regions are also 
shown . 

15 Figure 5 illustrates a schematic model derived for the 
7 9 kD protein encoded by pMBBSIA which represents an 
ion-motive ATPase of BCG. The model considers only the 
structural and functional features that are prominent 
in the other ion-motive ATPase homologs of 

20 transmembrane domains of the protein. Functionally, 
important amino acid residues are indicated (P) , 
proline at; position 400; (D) , aspartic acid at position 
443; (G) , glycine at position 521; and (A), -alanine at 
position 646. Numbers indicate amino acid residues 

2 5 broadly defining the limits of the transmembrane 
domains. 

Figure 6 illustrates the results of Southern blot 
hybridization of BamHI digest of genomic DNAs from 
bovis BCG (lane 6) , |4. tuberculosis H37Rv (lane 5) , H. 



30 smeamatis (lane 4) and M. vaccae (lane 3jusing pMMBSIA 
DNA insert (lane 8) as probe. Panel A**6hows ethidium 
bromide stained gel and panel B shows the results of 
Southern blot hybridization. 
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Detailed Description of the Invention 

As used herein, a "membrane-associated polypeptide" of 
a mycobacterium is defined as any Mycobacterial 
membrane-associated polypeptide which is capable of 
5 detecting an immune response against the wild-type 
mycobacterium containing the membrane-associated 
polypeptide. However, based upon the observed cross- 
reactivity of the 79 kD membrane-associated polypeptide 
of an Jl. bovis BCG with pooled anti-sera from patients 

10 afflicted with tuberculosis and the cross-hybridization 
as between the DNA encoding the 79 kD membrane- 
associated polypeptide and the DNA of M. tuberculosis 
H37Rv, the membrane-associated polypeptide of the 
invention is not limited to that identified herein from 

15 M. bovis BCG. Rather, it encompasses not only homologs 
to the 79 kD ion-motive ATPase but also any and all 
membrane-associated polypeptides of a mycobacterium 
that can be used to detect an immune response by the 
same or a different mycobacteria in which the membrane- 

20 associated polypeptide is normally found. 

As used herein, "nucleic acid" includes DNA or RNA as 
well as modified nucleic acid wherein a detectable 
label has been incorporated or wherein various 
modifications have been made to enhance stability, 

25 e.g., incorporation of phosphorothioate linkages in the 
phosphoribose backbone, etc. Such nucleic acid also 
includes sequences encoding the anti-sense sequence of 
the DNA encoding the membrane-associated polypeptide 
such that the now well-known anti-sense technology can 

3 0 be used to modulate expression of such membrane- 
associated polypeptides. 

In some aspects of the invention, the nucleic acid 
sequence encoding all or part of a membrane-associated 
polypeptide of the mycobacterium is used as a vaccine. 
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When so-used the nucleic acid is generally an 
"expressible nucleic acid" that contains all necessary 
expression regulation sequences to control 
transcription and translation of the nucleic acid in a 
5 designated host system. In some vaccine embodiments, 
the DNA encodes a chimeric polypeptide containing at 
least one transmembrane domain of the membrane- 
associated polypeptide and an "immunogenic 
polypeptide". The transmembrane domain is used to 

10 display the immunogenic polypeptide on the surface of 
a particular host organism such as an attenuated live 
vaccine. When the membrane-associated polypeptide 
includes more than one transmembrane region, one or 
more of the transmembrane regions can be used with an 

15 immunogenic polypeptide. Thus, for example, the 79 kD 
ion-motive ATPase as shown in Figure 5 has at least 
three extracellular domains into which an immunogenic 
polypeptide can be engineered by well-known methods 
involving recombinant DNA technology. Although it is 

20 preferred that more than one transmembrane region be 
used to display an immunogenic polypeptide, one skilled 
in the art can readily vary the length of such a 
membrane-associated polypeptide to maximize an 
immunogenic response or to minimize the amount of 

2 5 membrane-associated polypeptide used - in such 

applications . 

As used herein, "immunogenic polypeptide" comprises all 
or part of any polypeptide which can potentially be 
utilized in a vaccine or diagnostic application. Thus, 

3 0 the immunogenic polypeptide can comprise heterologous 

immunogens, i.e., immunogens from non-mycobacterial 
sources, e.g. , Salmonella or Shigella or from different 
mycobacteria from which the membrane-associated 
polypeptide is derived, e.g., immunogens from 
3 5 Mycobacterium tuberculosis fused to a membrane- 
associated polypeptide from M. bovis BCG. However, in 
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some instances homologous immunogens can be used. For 
example, each of the extracellular domains as set forth 
in Figure 5 herein can be combined and displayed by 
combination with one or more of the transmembrane 
5 domains from the membrane-associated polypeptide 
normally containing them. Alternatively, the 

intercellular domains can be displayed extracellularly 
using appropriate transmembrane regions from the same 
molecule, 

10 In an alternate vaccine embodiment, all or^part of the 
^ membrane-associated polypeptide of m^bact^ja-, rather 
than the DNA encoding, is used as^part of a vaccine. 
Such proteinaceous vaccines are formulated with well- 
known adjuvants and administered following well- 
15 established protocols known to those skilled in the 
art. 

In still other embodiments, the nucleic acid encoding 
the membrane-associated polypeptide of the invention 
can be used as a diagnostic for detecting infection 

20 based upon hybridization with wild-type genes contained 
by the infectious mycobacterium. Such detection can 
comprise direct hybridization of DNA extracted from an 
appropriate diagnostic sample or PCR amplification 
using the nucleotide sequence of the nucleic acid 

25 encoding the membrane-associated polypeptide of the 
invention to prime amplification. If PCR amplification 
is primed in a conserved region the presence of 
mycobacteria in a diagnostic sample can be determined. 
If primed in ; a non-conserved region which is species 

30 specific the diagnostic assay determined the specific 
mycobacterium causing an infection. 



In addition, the membrane-associated polypeptide of the 
invention can also be used to detect the presence of 
antibodies in the sera of patients potentially infected 
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with mycobacteria. Such detection systems include 
radioimmunoassays and various modifications thereof 
which are well-know to those skilled in the art. In 
addition, the membrane-associated polypeptide of the 
5 invention can be used to detect the presence of a cell- 
mediated immune response in a biological sample. Such 
assay systems are also well-known to those skilled in 
the art and generally involve the clonal expansion of 
a sub-population of T cells responding to stimuli from 
10 the membrane-associated polypeptide. When so-used, the 
humoral and/or cell-mediated response of a patient can 
be determined and monitored over the course of the 
disease. 



Recombinant clones encoding immunogenic protein 

15 antigens of M . bovis BCG have been isolated from a 
genomic library of M. bovis BCG DNA. In particular, 
DNA fragments encoding four protein antigens of M. 
bovis BCG have been isolated by probing a pBR322 
library of M. bovis BCG DNA with sera from TB patients, 

20 absorbed on E. coli . Restriction site maps of these 
four recombinant clones are different from those of the 
five immunodominant antigens of mycobacteria (Young, et 
al., Proc 1 Natl. Acad. Sci. , U.S.A., 82:2583-2587 
(1987) ; Husson and Young, Proc. Natl. Acad. Sci. . 

25 U.S.A., 84:1679-1683 (1987); Shinnick et al., Infect. 
Immun . , 55:1718-1721 (1987)), thereby indicating that 
these cloned protein antigens are novel. One of the 
recombinant DNA clones encoded an immunoreactive 
protein with apparent molecular weight of 90 kD as 

3 0 determined by Western blot analysis. The complete 
nucleotide sequence of the insert DNA of this clone was 
determined. This clone was found to carry a 
mycobacterial promoter and a monocistronic ORF encoding 
a protein of 761 amino acids with a deduced molecular 

35 weight of 79 kD. This 79 kD protein had extensive 
homology with ion-motive ATPases of S. f aecalis (Solioz 
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et al., J. Biol, chem . 262:7358-7362 (1987)), E . coli 
(Hesse et al., Proc. Natl. Acad. Sci. , U.S.A. , 81:4746- 
4750 (1984)) and several other organisms, and thus, 
represents an ion-motive ATPase or a putative K+ATPase 
5 of BCG. Using computer algorithms, this ion-motive 
ATPase was determined to be a membrane protein and has 
a homologue in M. tuberculosis H37Rv, which is 
pathogenic in humans, but not in M. vaccae and 1$.' 
smegmatis, which are non-pathogenic. As a result, 

10 novel BCG immunogens can be available which can be 
useful in the prevention, diagnosis and treatment of 
tuberculosis and other mycobacterial infections. They 
can be used, for example, in the development of highly 
specific serological tests for screening patients for 

15 individuals producing antibodies to M. tuberculosis , or 
those infected with M. tuberculosis , in the development 
of vaccines against the disease, and in the assessment 
of the efficacy of the treatment of infected 
individuals. 

2 0 Further, based on the nucleotide sequence of the 

pMBB51A insert DNA, appropriate oligonucleotide primers 
can be used for PCR amplification using as template M. 
bovis BCG or M . tuberculosis H3 7Rv DNA . Such a PCR 
amplification scheme can be thus useful for the 
25 detection of mycobacterial DNA in a given sample. 
Further, by a judicious choice of the primer design, 
such an amplification procedure can be adapted for 
taxonomic classification of mycobacterial DNAs. For 
example, using primers to flank a heavily conserved 

3 0 region such as the ATP-binding site, PCR amplification 

is common to all mycobacterial species, whereas using 
primers from non-conserved areas, amplification can be 
made species specific. 
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Example I 



5 



Isolation and Characterization of Genes 
Encoding Immogenic Protein Antigens 

of Mycobacterium bovis BCG 
and Mycobacterium tuberculosis H37R 



A. Construction of Recombinant DNA 

Libraries of M. bovis BCG DNA and 
Mycobacterium Tuberculosis H37Rv 

A recombinant DNA library of M. bovis BCG genomic DNA 
10 was constructed using pBR322 a high copy number plasmid 
vector (Bolivar, et al., Gene , 2:95-113 (1977)) with 
antibiotic markers (ampicillin and tetracycline) and 
several unique cloning sites, M. bovis BCG cells were 
harvested from a culture in late logarithmic phase of 
15 growth and high molecular weight DNA was isolated by 
the procedure of (Eisenach, et al., J . Mol . Biol . . 
179:125-142 (1986)) with slight modifications. BCG DNA 
was digested to completion with BamH I and shotgun 
cloning of these fragments into the BamH I site of 
20 pBR322 was performed. The genomic library was 
transformed into E . coli strain DHI and recombinants 
were scored on the basis of ampicillin resistance and 
tetracycline sensitivity. The aim of this approach 
was to generate restriction fragments of a broad size 

2 5 range so as not to restrict the library to DNA 

fragments of any particular size range. This cloning 
strategy also ensured to a large extent that any 
recombinants selected for expression of mycobacterial 
antigens should be likely to drive expression from a 

3 0 mycobacterial promoter rather than the Tet promoter of 

pBR322. 

The BCG library constructed in this manner contained 
2051 clones of BCG origin. In an analogous manner, a 
genomic library of Mycobacterium tuberculosis H37Rv DNA 
35 was constructed and 1100 clones obtained. 
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Th e BCG DNA inserts ranged in size from 0-9 to 9.5 kb. 
The average size of the mycobacteria DNA fragments 
inserted in pBR322 was estimated to be about 4 kb. 
Given the genome size of BCG to be 4.5 x 10 3 kb 
5 (Bradley, S. G. , J. Bacterid. , 113:645-651 (1973); 
Imaeda, et al., Int. J. Syst. Bacterid. , 32, 456-458 
(1982)), about 1000 clones of this average insert size 
would represent comprehensively the entire genome of' 
the microorganism. 



10 B. Isolation of Recombinant DNA Clones Encoding 
BCG Mycobacterium bovis BCG and Mycobacterium 
tuberculosis H37Rv Protein Antigens 

In order to identify recombinants expressing 
mycobacterial antigens, a colony immunoscreening assay 

15 (CIA) to screen recombinant colonies with appropriate 
antisera, was established. Sera obtained from 20 
patients newly diagnosed with active pulmonary 
tuberculosis were pooled for use in immunoscreening. 
None of the patients had received treatment for 

2 0 tuberculosis prior to this study and their sputa were 
positive for acid fast bacteria in all cases. Pooled 
sera were absorbed on a E. coli sonicate overnight at 
4°C, to eliminate antibodies cross-reactive to E. coli 
antigens, thereby improving signal to noise ratio 

25 during the immunoscreening. 



Individual recombinant colonies were grown overnight on 
nitrocellulose membranes and immunoscreening was 
carried out as described with slight modifications. 
The colonies were lysed in chloroform vapor to release 
3 0 the cloned mycobacterial antigens, immobilized on the 
nitrocellulose paper. The immobilized antigens were 
reacted with TB sera and binding of the antibody was 
revealed by standard procedures using a horseradish 
peroxidase-protein A detection system. The signals 
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obtained with the recombinant clones were compared with 
that obtained in case of E. coli colonies harbouring 
pBR322 vector alone, which served as the negative 
control, to assess the signal to noise ratio. Further, 
5 to ascertain whether the immunoreactivity of the 
recombinant clones was due to anti-mycobacterial 
antibodies or due to a reaction with normal serum * 
components, another CIA of the selected recombinants 
was performed using TB sera and normal human sera NHS 

10 which had been absorbed on E. coli in a manner 
analogous to that described earlier for TB sera. Only 
those clones reacting selectively with TB sera and not 
with NHS, were considered to be unambiguously 
suggestive of the presence of mycobacterial antigens, 

15 The use of this immunoscreening approach to identify 
recombinant colonies carrying mycobacterial DNA inserts 
capable of expressing mycobacterial antigens is 
described below: 

Figure 1 shows the result of immunoscreening of 

20 recombinant colonies carrying M. bovis BCG DNA (panel 
A) or M. tuberculosis H37 Rv DNA (panel B) using sera 
from TB patients. The colonies were grown on 
nitrocellulose paper" overnight, lysed to release the 
cloned mycobacterial antigen and allowed to react with 

25 the antibodies. The presence of mycobacterial antigen 
is indicated by a qualitative signal in the recombinant 
clones which is absent in the negative control 
comprising colonies harbouring pBR322 vector alone. A 
J similar assay was repeated with normal human serum to 

3 0 ascertain the specificity of the cloned mycobacterial 
antigens. 51 recombinant colonies carrying M. bovis 
BCG DNA inserts and 45 recombinant colonies carrying £1. 
tuberculosis H37Rv DNA inserts were screened by the 
above procedure; 14 clones of BCG origin (panel A) and 

35 2 clones of H37Rv origin (panel B) exhibited distinct 
strong signals indicating the immunoreactivity of these 



-19- 

clones (Fig, 1) . All these clones were also tested for 
immunoreactivity with NHS • However, with the exception 
of 3 clones which showed a slight reactivity to NHS, 
none of the clones reacted with NHS, thereby indicating 
that these expressed mycobacterial antigens reacted 
selectively with TB sera. Thus, this procedure 
resulted in the forthright identification of 
recombinant clones encoding mycobacterial antigens. 
This strategy can be generally applicable to 
mycobacterial gene banks prepared in plasmid or cosmid 
vectors to identify genes which are expressed in E. 
coli at least to the limit detectable by the 
immunoassay. 

C. Restriction Mapping of Immunoreactive 

Mycobacterium bovis BCG DNA Recombinants 

The insert DNAs of four of the immunoreactive BCG 
recombinant DNA clones isolated using the TB sera were 
mapped with restriction endonucleases . Figure 2, panel 
B, shows the genomic DNA restriction site maps deduced 
for the cloned BCG DNA in four recombinants, in which, 
A represents Sal I, B, BamH I, E, EcoR I, G, Bgl II, K, 
Kpn I, P, Pvu I, S, Sac I, X, Xho I. These restriction 
site maps were then compared with those constructed 
previously for the five immunodominant antigens of 
tuberculosis/ M . bovis BCG (Young, et al., Proc. Natl. 
Acad. Sci. . U.S.A., 82:2583-2587 (1985); Husson, et 
al., Proc. Natl. Acad. Sci. , 84:1679-1683 (1987); 
Shinnick, et al . , Infect. Immun. , 55, 7:1718-1721 
(1987)) (Figure 2, panel A). Since the restriction 
site maps shown in panels A and B have been drawn to 
the same scale, the differences between the two are 
apparent. There are no regions of similarity between 
the restriction site maps of immunoreactive BCG 
recombinant clones and those of the previously 
characterized immunodominant antigens of M. 
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tabsE£ u lfisi§ / !S . b^is BCG. Therefore, one 

; hat the cloned BCG DMA inserts in the four 

recombinants are novel. 

isolation and ATPase 
„t „ r.. ne E r ™' tin- a BCG Ton-motive. ATPase 



A. 



T, Q nHfi. a ti- ^ * "°ve1 BTfl Antigen 



One of the four immunoreactive BCG clones, pMBBSIA, 
revealed the presence of a protein of Mr 90 ». on 
10 Western blot analysis using TB sera as well 

H37HV polyclonal antiserum raised in rabbits (Figure 
3) Similar Western blot analysis of pMBB51A with a 
pool of a few anti-mycobacterial monoclonal antibodies 
(TB 23, TB 71, TB 72, TB 68, TB 78; Engers et al., 
15 Infec^Bm-, 48:603-605 (1985)) or with normal human 
sera did not reveal this immunoreactive protein of 90 
kD This confirms that pMBB51A encodes a BCG antigen 
which is different from those identified previously in 
BCG, thereby making it a novel antigen. 

20 B. Determination of the 

Mncleot^ft Seque nt " f pMBBSIA 

In order to further characterize this novel BCG 
antigen, pMBB5lA DNA insert was subjected to nucleotide 
sequencing. The BamH I-BamH I insert carried in 
25 PMBB51A was mapped for additional restriction enzyme 
cleavage sites. It was determined that there were at 
a minimum a single Pst I site and 3 Sal I sites 
sequence. Overlapping fragments derived from single 
and double digests of Sal I, BamH I and Sal I, BamH I 
30 and Pst I, and Pst I and Sal I, were subcloned into 
M13m P 18 and M13m P 19 vectors, in preparation for DNA 
sequence analysis. DNA sequencing was then carried out 
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using commercially available kits such as the Sequenase 
system and the T7 system from Pharmacia. 
Oligonucleotides derived from the determined sequence 
were synthesized and used as primers to complete the 
5 sequence of the larger inserts. Several areas of 
compression were encountered during the sequencing and 
these were resolved by using dITP in the sequencing 
reactions, and by changing the reaction conditions. 
The complete nucleotide sequence of the pMBBSIA insert 
10 DNA was determined by sequencing both the strands using 
dGTP as well as dITP. The DNA sequence of the pMBBSIA 
insert was determined to be 3.25 kb long with a GC 
content of 67.1% and is shown in Figure 4. 

The determination of the DNA sequence of the 3.25 kb 
15 insert of clone pMBBSIA (Figure 4) permitted the 
elucidation of the amino acid sequence of the 90 kD BCG 
antigen. In Figure 4, nucleotides are numbered from 
the left end of the pMBBSIA insert DNA. 

A search of pMBBSIA insert DNA sequence for possible 
20 ORFs in all three reading frames revealed the longest 
ORF of 2286 bp encoding a polypeptide of 761 amino 
acids on one of the strands. The other strand was 
found to have a smaller URF of 1047 bp capable of 
encoding a polypeptide of 349 amino acids. The longest 
25 ORF encoding a 761 amino acid long protein corresponded 
to a deduced molecular weight of 79 kD which came 
closest to the immunoreactive BCG protein with apparent 
molecular weight of 90 kD, seen on the Western blot. 
The deduced amino acid sequence for this protein is 
30 given below the nucleotide sequence in Figure 4, 

The location of this ORF on the pMBBSIA insert DNA was 
such that there were long stretches of flanking DNA 
sequences, devoid of any meaningful ORFs, present on 
either side. This precluded the expression of this ORF 
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from the pBR3 2 2 Tet gene promoter and instead suggested 
that this ORF was being expressed from its own promoter 
in pMBBSlA. This also suggested that E. coli may 
correctly utilize the M. bovis BCG transcription and 
5 translation start and stop sites in this gene. 

Immediately upstream of the ORF, regulatory sequences 
closely matching the -35, -10 and Shine-Dalgarno' 
sequences of E. coli , (Rosenberg, et al., Annul. Rev. 
Genet. . 13:319-353 (1979)) were identified. The 
10 spacing between these three regulatory motifs was also 
very well conserved. Although the other mycobacterial 
£3 promoters sequenced (Dale, et al., Molecular Biology of 

]i the Mycobacteria , chap. 8, 173-198 (1990)) show some 

Ly differences from the E. coli consensus sequences in all 

yi 15 the three regions -3 5, -10 and SD, the regulatory 

fy elements of pMBBSlA DNA showed a maximum degree of 

u sequence identity with E. coli in the -3 5 and SD 

u sequence elements with a single mismatch in each 

M element, and about 50% sequence identity in the Pribnow 

S ft 20 box. All the above features clearly indicated that 

Q this region is the promoter region for the 

mycobacterial gene contained in pMBBSlA . The extent of 
similarity between this BCG promoter sequence and a 
typical E. coli promoter is remarkable and explains the 
25 functional activity of this promoter, unlike many other 
mycobacterial promoters, in E. coli . The translation 
initiation codon in this ORF was ATG at position 508 
while a single translation termination codon TGA was 
identified at position 2790. Potential transcription 
3 0 termination structures capable of forming stem and loop 
conformations were identified in the region 3 • to this 
ORF. The pMBBSlA ORF thus represented a monocistronic 
gene rather than an operon. The promoter region of 
MBB51A gene is capable of directing gene expression in 
3 5 E. coli as well as in mycobacteria. This promoter 
sequence, is useful for directing expression of 




mycobacterial genes in E. coli . Further, this promoter 
sequence can also be used to express homologous and/or 
heterologous genes in a mycobacterium, thus providing 
a key element for the development of gene expression 
5 systems in mycobacteria. 

In order to derive information about the possible 
biological function of the MBB51A protein, the amino 
acid sequence of this protein was used to search for 
homology against available sequences in the PIR Protein 

10 Database Release 20 (Table I) and a Genebank Nucleic 
Acid Database (Table II) using the Fast A suite of 
programmes written by (Lipman and Pearson, Proc. Natl. 
Acad. Sci. . USA, 85:2 (1988)). The MBB51A protein 
sequence exhibited homology to a family of ion-motive 

15 ATPases from different organisms, ranging from bacteria 
to mammals. The 13 best scores from a search with 
ktuple 2 are shown in the upper panel of Table I and 10 
best scores from a search with ktuple 1 are shown in 
the lower panel. In each case, MBB51A protein 

20 exhibited maximum homology (75.9% homology in a 593 
amino acid overlap with 31.9% identity to a K+ 
transporting ATPase of S. f aecalis (Solioz et al . , 
1987) . The next best homology was observed with the B- 
chain of K+ transporting ATPase of E. coli (Hesse, et 

25 al., Proc. Natl. Acad. Sci. . U.S.A., 81:4746-4750 
(1984)) (68.8% homology in a 397 amino acid overlap 
with 24.2% identity). A lesser extent of homology was 
also seen with H+, Ca++ and Na+-ATPases from different 
organisms. The results of homology search thus 

3 0 indicated that MBB51A protein is an ion-motive ATPase 
of M. bovis BCG and is closely related to the other 
bacterial ion-motive ATPases. This is the first report 
of the cloning and identification of such an ATPase in 
mycobacteria. The BCG ion-motive ATPase showed 

35 homologies with other ion-motive ATPases with 
overlapping regions ranging in size from 593 amino 
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acids in case of S* f aecalis to 82 amino acids as in 
case of li. donovani , (Meade, et al., Mol . Cell Biol, , 
7, 3937-3946 (1987)), though most of the regions of 
sequence identity or conservation were localized in the 
5 C-terminal half of the MBB51A protein. Further , a 
region of 30 amino acids in the C-terminal half of 
MBB51A protein was found to be shared with most of 
these ATPases, thereby suggesting the functional 
importance of this region. Detailed alignment of 
10 MBB51A protein with the K+ ATPases of S, f aecalis and 
E. coli also indicated that several residues were 
conserved between the three ATPases, including the ones 
that are invariant in all ATPases from bacteria to man, 

TABLE I 



15 RESULTS OF HOMOLOGY SEARCH OF MBB51A 

AMINO ACID SEQUENCE AGAINST PIR PROTEIN DATABASE 



ktuple : 2 



LOCUS 


SHORT DEFINITION 


initn 


opt 


>A29576 


Potassium - transporting ATPase Streptococcus 


547 


792 


>PWECBK 


Potassium - transporting ATPase. B chain - E.coli 


314 


270 


>A25939 


Proton - transporting ATPase - Neurospora 


168 


186 


>A25623 


Proton - transporting ATPase - Yeast 


166 


184 


>PWRBFC 


Calcium^ - transporting ATPase, fast twitch skele 


152 


158 


>PWRBSC 


Calcium - transporting ATPase, slow twitch skele 


135 


157 


>A25344 


Potassium - transporting ATPase - Rat 


78 


155 


>RDEBHA 


Mercuric reductase -Shigella flexneri plasmid 


99 


142 


>RDPSHA 


Mercuric reductase (transposon TnSOl) 


74 


124 


>RGPSHA 


Mercuric resistance operon regulatory p 


79 


109 


>A24639 


Sodium/potassium- trans porting ATPase, alpha 


92 


. 82 


>A24414 


Sodium/potassium- transporting ATPase, alpha 


92 


82 


>B24862 


Sodium/potassium- transporting ATPase, beta 


83 * 


82 



The PIR protein data base (2378611 residues in 9124 sequences) was 
scanned with the FASTA program. The mean of the original initial 
score was 27.2 with a standard deviation of 6.9. Initial scores 
3 5 (initn) higher than 75.6 are 6 standard deviations above the 
average, a level of significance that usually indicates biological 
relatedness. Optimization (opt) generally will improve the 
initial score of related proteins by introducing gaps in the 
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sequence. Unrelated sequences usually do not have their scores 
improved by optimization. 



10 



ktuple : 1 

>A29576 potassium-transporting ATPase - Streptococcus 

>PWECBK potassium-transporting ATPase, 0 chain - Esche 

>A25939 Proton -transporting ATPase - Heuxospora crassa 

>A2S823 proton-transporting ATPase -Yeast ( Sac char oray ) 

>B24639 Sodium/pot as si urn- transporting ATPase, alpha < + 

>A24639 Sodium/potassium- transporting ATPase, alpha ch 

>C24639 Sodium/potassium- transporting ATPase .alpha (II 

>PWRBFC Calcium-transporting ATPase, fast twitch skele 

>PWSHNA Sodium/potassium- transporting ATPase, alpha skele 

>A24414 Sodium/potass iura- transporting ATPase, alpha chain 



744 


792 


366 


270 


310 


186 


317 


184 


158 


163 


175 


160 


192 


159 


240 


158 


214 


156 


214 


158 



TABLE II 



15 



20 



25 



30 



35 



RESULTS OF HOMOLOGY SEARCH OF MBB51A AMINO ACID SEQUENCE 
AGAINST GENBANK NUCLEIC ACID SEQUENCE DATABASE 



ktuple 



40 



LOCUS 


SHORT DEFINITION 


initn 


opt 


>STRATPK 


S.faecalis K+ ATPase, complete cds . 


537 


600 


>ECOKDPABC 


E.coli kdpABC operon coding for Kdp-ATpase 


314 


270 


>YSPPMA1A 


S.pombe H+ ATPase, complete cds. 


135 


188 


>NEUATPASE 


H. crassa plasma membrane ATPase, complete 


133 


186 


>NEUATPPM 


Neurospora crassa plasma membrane H+ ATPase 


131 


186 


>YSCPMA1 


Yeast PMA1 for plasma membrane ATPase 


166 


184 


>M17889 


Figure 2. N of L.donovani ATPase and 


166 


170 


>M12898 


Rabbit fast twitch skeletal muscle Ca++ ATPas 


140 


158 


>RABATPAC 


Rabbit Ca + Mg dependent Ca++ ATPase mRNA, co 


142 


157 


>NR1MER 


Plasmid NR1 mercury resistance (mer) operon. 


100 


143 


ktuple : 1 








>STRATPK 


S.faecalis K+ ATPase gene, complete cds. 


744 


600 


>SYNCATPSB 


Cyanobacterium Synechococcus 6301 DNA for AT 


379 


422 


>ECOKDPABC 


E.coli kdpABC operon coding for Kdp-ATPase p 


379 


270 


>YSPPMA1A 


S.pombe H+ ATPase gene, complete cds. 


275 


188 


>NEU ATPASE 


N. crassa plasma membrane ATPase gene, comple 


311 


186 


>NEUATPPM 


Neurospora crassa plasma membrane H+ ATPase 


302 


186 


>YSCPMA1 


Yeast PMA1 gene for plasma membrane ATPase 


317 


164 


>JO4004 


Leishmania donovani . cation transporting ATP 


322 


170 


>M17889 


Figure 2. Nucleotide seguence of L.donovani 


306 


170 


>RATATPA2 


Rat Na+,K+ ATPase alpha <+) isoform catalytic 


156 


163 




* * * 







The KdpB protein of E • coli and possibly the S. 
f aecalis K+ ATPase are members of ElE2-ATPases which 
are known to form an aspartyl phosphate intermediate, 
with cyclic transformation of the enzyme between 
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phosphorylated and dephosphorylated species. By analogy 
to other ATPases, the phosphorylated Asp residue (D) 
(Furst, et al., J. Biol. Chem. , 260:50-52 (1985)) was 
identified at position 44 3 in the MBB51A ATPase. This 



that has been conserved in ATPases from bacteria to 
man, and must form an essential element of the' 
catalytic site. Similarly, proline (P) at position 400 
in MBB51A ATPase was found to be an invariant amino 

10 acid in other ATPases and is predicted to be located in 
a membrane spanning domain. Such membrane buried 
proline residues have been hypothesized to be required 
for the reversible conformational changes necessary for 
the regulation of a transport channel (Brandl, et al., 

15 Proc. Natl. Acad. Sci. . U.S.A., 83:917-921 (1986)). In 
addition, other sequence motifs believed to be 
functionally important in other ion-motive ATPases were 
also found to be conserved in the MBB51A ATPase. These 
include a Gly (G) (Farley and Faller, J. Biol . Chem. , 

20 260:3899-3901 (1985)) at position 521 and Ala (A) 
(Ohta, et al., Proc. Natl. Acad. Sci. , U.S.A., 83:2071- 
2075 (1986)) at position 646, and are shown in 
Figure 5. 

Since the MBB51A ATPase was homologous to membrane 
25 associated ATPases, characterization of the membrane 
associated helices in MBB51A protein was performed by 
computer algorithms. Using a hydropathy profile (Rao, 
et al., Biochem. Bioohvs. Acta. , 869:197-214 (1986)), 
seven transmembrane domains in the MBB51A protein were 
3 0 identified and are shown in Table III and Figure 5. 
Nearly the same transmembrane domains were also 
identified using the hydrophobic moment plot (Eisenberg 
et al., J. Mol. Biol. , 179:125-142 (1984)) and are also 
shown in Table III and Figure 5. The average size of 
3 5 a transmembrane domain is around 21 residues, because 
21 residues coil into an a-helix approximately the 



5 



residue is the first of a pentapeptide sequence DKTGT 
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thickness of the apolar position of a lipid bilayer (32 
A) . This size of a transmembrane domain is, however, 
flexible within the range of a few amino acids , as 
determined by the functional properties of a given 
5 membrane-associated protein. The transmembrane domains 
identified in MBB51A protein, range in size from 20-37 
residues. The first six transmembrane domains span the 
membrane only once, as indicated by both the hydropathy 
profile and the hydrophobic moment plot. The seventh 

10 transmembrane domain may traverse the membrane twice. 
These features along with the membrane buried proline 
(P) at position 4 00, are in accordance with the channel 
transport functions of ion-motive ATPases, involving a 
reversible change in the conformation of these 

15 proteins. Such transmembrane domains further define 
the intracellular and extracellular domains of this 
molecule. See Figure 5. 







Table III 








Transmembrane 


Eisenberg 


Rao & 


Argos 


20 


Domain in Fig. 5 


Method 


Method 




1 


102 - 122 


98 


- 125 




2 


129 - 149 


127 


- 147 




3 


164 - 184 


164 


- 185 




4 


199 - 219 


198 


- 220 


25 


5* 


361 - 381 


360 


- 382 




6 


387 - 407 


387 


- 419 




7 


703 - 723 


695 


- 732 



The hydropathy profile of MBB51A protein was nearly 
super imposable over that of S. faecalis K+ ATPase, even 
30 though the MBB51A ATPase has at the N-terminus, 154 
extra amino acids, which were absent in S. faecalis. 
This clearly puts in evidence the strong evolutionary 
conservation of the broad domain structure between 
these two proteins, making it more likely for the two 
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proteins to have a similar three dimensional structural 
organization. 

Based on the hydropathy profile and secondary structure 
predictions, a schematic model of the MBB51 A ATPase is 
5 presented in Figure 5. This model comprises at least 
seven transmembrane domains which span the membrane, 
once are indicated along with the respective amino acid 
positions in Figure 5* This model further defines 
extracellular and intracellular domains of the MBB51A 

10 protein* Many of the residues which have been shown to 
be functionally important in other ion-motive ATPases 
and are also conserved in the MBB51A protein, are also 
shown. Of these, proline (P) at position 400 is 
membrane-buried whereas as aspartic acid(D) at 443, 

15 glycine (G) at 521 and alanine (A) at 646, face the 
cytoplasm* 



In order to determine whether the gene encoding MBB51A 
ion-motive ATPase is present in other mycobacterial 
strains related or unrelated to BCG, like the virulent 
20 strain M. tuberculosis H37Rv and other non-tuberculous, 
P non-pathogenic mycobacteria like M . vaccae and M. 
\ smeamatis Southern blot hybridization with genomic DNA 
\ from the above species was performed, using as probe 
\ BCG insert DNA from pMB651A- As shown in Figure 6, DNA 
25 hybridizable with the pMBB51A insert' DNA was also 
present in M . tuberculosis H3 7Rv DNA but not in £1 . 
I smeamatis and M. vaccae . This indicated that the M. 
tuberculosis H37Rv homologue of the pMBBSlA gene has a 
similar genetic organization as seen in M, bovis BCG 
3 0 DNA, and is present on a 3*25 kb BamH I fragment* 

The availability of novel Mycobacterium bovis BCG 
and/or Mycobacterium tuberculosis H37Rv antigens make 
it possible to address basic biochemical, 
immunological, diagnostic and therapeutic questions 
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still unanswered about tuberculosis and Mycobacterium 
tuberculosis . For example, Mycobacterium tuberculosis 
specific antigenic determinants can be used to develop 
simple and specific seroepidemiological tests to screen 
human populations. Such serological tests are highly 
specific because of the use of antigenic determinants 
determined by the approaches described above and known' 
to be unique to Mycobacterium tuberculosis H37Rv. Such 
serological tests are useful for early diagnosis of 
tuberculosis, thus permitting early treatment and 
limiting transmission of the disease from infected 
individuals to others. 

Resistance to tuberculosis is provided by cell mediated 
immunity. The antigens identified here can be further 
used to determine which segments of these antigens are 
recognized by Mycobacterium tuberculosis specific T— 
cells. A mixture of peptides recognized by helper T- 
cells provides a specific skin test antigen for use in 
assessing the immunological status of patients and 
their contacts. A mixture of such peptides is also 
useful in evaluating rapidly the immunological efficacy 
of candidate vaccines. In addition peptides recognized 
by Mycobacterium tuberculosis specific T-cells can be 
components of a vaccine against the disease. 

Knowledge of the complete nucleotide sequence of 
pMBBSIA DNA insert provides a rich source of sequence 
information which can be used to design appropriate 
primers for PCR amplification of mycobacterial genomic 
DNA fragments. The ion-motive ATPase of BCG has areas 
of heavily conserved sequences (for, e.g., the ATP 
binding site) which are expected to be the same for all 
mycobacterial species and areas of sequence divergence 
(for, e.g., the N-terminal region) which are different 
in different mycobacterial species. Based on this 
knowledge primers can be designed either from the 
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conserved regions or from the diverged regions to 
identify whether in a given sample the target DNA is 
mycobacterial versus non-mycobacterial , and in case of 
mycobacterial DNA, which mycobacterial species the DNA 
5 belongs. 

Such amplification schemes are useful for the 
development of highly sensitive and specific PGR 
amplification based diagnostic procedures for 
mycobacteria. The observation that the 3.25kb pMBBSIA 

10 DNA insert is present in Mycobacterium tuberculosis 
H37Rv and Mycobacterium bovis BCG and is absent in 
avirulent Mycobacterium vaccae and Mycobacterium 
smegmatis , which have bearing on other aspects of the 
biological differences between these species, manifest 

15 in terms of virulence, growth characteristics and 
metabolism. 

Recombinant vaccines can also be constructed by 
incorporating the DNA encoding all or part of the 
membrane-associated polypeptides of the invention into 

20 an appropriate vaccine vehicle. For example, all or 
part of the DNA encoding the 79kD Mycobacterium bovis 
BCG protein or a portion of the protein can be 
incorporated into a vaccine vehicle capable of 
expressing the said DNA. Such a vaccine vehicle could 

25 be a virus for, e.g., vaccinia virus, etc., or a 
bacterium, e.g., mycobacteria, Salmonella, Vibrio, 
Bacillus, Yersinia, Bordetella, etc. to produce a 
vaccine capable of conferring long-lasting immunity on 
individuals to whom it is administered. 

30 A special feature of the 79kD BCG ion-motive ATPase is 
that it is a membrane bound antigen. Therefore, it can 
be used to link foreign DNA sequences encoding 
antigenic epitopes (B-cell epitopes or T-cell epitopes) 
of interest, with this gene or a portion of this gene 
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in a manner which causes the foreign epitope to be used 
as an immunogen. Such linkages can be engineered into 
extracellular or intracellular domains of MBB51A 
protein, or into a combination of both types of 
domains. Engineering of immunogenic foreign epitopes 
into MBB51A DNA is accomplished by standard recombinant 
DNA methods known to those skilled in the art. Some of, 
these methods involve use of unique restriction sites, 
in vitro mutagenesis and/or PCR-related methods. One 
such convenient method involves the use of a unique 
Ndel site at position 1090 in the MBB51A DNA where 
foreign DNA can be inserted. Grafting of epitopes on 
the cell surface induces rapid antibody response by 
virtue of the epitope being well-exposed on the 
bacterial cell, which in turn leads to direct 
activation of B cells. In addition,, intracellular 
localization of an epitope induces B cell memory and a 
proficient T cell response. Examples of epitopes of 
interest known to be involved in the immune response to 
various pathogens include epitopes from E. coli LT 
toxin, foot and mouth disease virus, HIV, cholera 
toxin, etc. 

Thus, the 79 kD antigen is useful in the design of 
recombinant vaccines against different pathogens. Such 
vaccines comprise a recombinant vaccine vehicle capable 
of expressing all or part of the 79 kD membrane- 
associated protein of mycobacteria, into which foreign 
epitopes have been engineered, such that the foreign 
epitopes are expressed on the outer surface and/or on 
the inner side of the cell membrane, thereby rendering 
the foreign epitopes immunogenic. The vaccine vehicle 
for this purpose may be a cultivable mycobacterium for, 
e.g., BCG. In these applications, the BCG ion-motive 
ATPase gene can be borne on a mycobacterial shuttle 
vector or alternately the foreign DNA encoding 
antigenic epitopes of the immunogenic polypeptides can 
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be inserted into the mycobacterial genome via 
homologous recombination in the ion-motive ATPase gene 
or random integration. Such a process yields stable 
recombinant mycobacterial strains capable of expressing 
5 on their surface and/or in the cytoplasm antigenic 
sequences of interest, which can, for example, provide 
protection against a variety of infectious pathogens., 
Targeting of recombinant antigens to the cell-wall is 
attractive not only because of the high immunogenicity 

10 of mycobacterial cell-walls but, in addition, because 
of concerns with the introduction of a live vaccine in 
populations with a high prevalence of HIV 
seropositivity . Additionally, based on the MBB51A 
protein, a non-living but immunogenic recombinant cell 

15 surface subunit vaccine can also be developed to 
provide a useful alternative to live vaccines. 
Alternately, other bacterial, viral or protozoan 
vaccine vehicles could be transformed to generate such 
recombinant vaccines. Examples of potential vaccine 

20 vehicles include vaccinia virus, pox-viruses, 
Salmonella, Yerisinia, Vibrio, Bordetella, Bacillus, 
etc. 

Further, * using such an approach, multivalent 
recombinant vaccines which allow simultaneous 
25 expression of multiple protective epitopes/antigens of 
different pathogens, could also be designed. 

Equivalents 

Those skilled in the art will recognize, or be able to 
ascertain, using no more than routine experimentation, 
3 0 many equivalents to the specific materials and 
components described specifically herein. Such 
equivalents are intended to be encompassed in the scope 
of the following claims. 
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(B) FILING DATE: 29-JUL-l\92 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Trecartin, Richard 

(B) REGISTRATION NUMBER: 31,801 

(C) REFERENCE/DOCKET NUMBER: A-\7004/RFT 

(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (415) 781-1989 

(B) TELEFAX: (415) 398-3249 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 508.. 2790 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGATCCCGCG GTCATCGATC GGGTCAAACA CCGCCTCGAC GGGTTCACGC TGGCGCCGCT 60 

GTCCACCGCC GCGGGAGGTG GTGGCCGGCA GCCACGCATC TACTACGGCA CCATCCTGAC 120 

CGGTGACCAA TACCTTCACT GCGAGCGCAC CCGCAACCGG CTGCACCACG AACTCGGCGG 180 

TATGGCCGTC GAAATGGAAG GCGGTGCGGT GGCGCAAATC TGCGCGTCCT TCGATATCCC 240 

ATGGCTGGTC ATTCGCGCGC TCTCCGATCT CGCCGGAGCC GATTCGGGGG TGGACTTCAA 300 

TCGGTTTGTC GGCGAGGTGG CGGCCAGTTC GGCCCGCGTT CTGCTGCGCT TGCTGCCGGT 360 

GTTGACGGCC TGTTGAAGAC GACTATCCGC CGGTGCGTTC ACCGCGTCAG GCGGCTTCGG 420 

TGAGGTGAGT AATTTGGTCA TTAACTTGGT CATGCCGCCG CCGATGTTGA GCGGAGGCCA 480 

CAGGTCGGCC GGAAGTGAGG AGCCACG ATG ACG GCG GCC GTG ACC GGT GAA 531 

Met Thr Ala Ala Val Thr Gly Glu 
1 5 

CAC CAC GCG AGT GTG CAG CGG ATA CAA CTC AGA ATC AGC GGG ATG TCG 579 
His His Ala Ser Val Gin Arg lie Gin Leu Arg lie Ser Gly Met Ser 
10 15 20 

TGC TCT GCG TGC GCC CAC CGT GTG GAA TCG ACC CTC AAC AAG CTG CCG 627 
Cys Ser Ala Cys Ala His Arg Val Glu Ser Thr Leu Asn Lys Leu Pro 
25 30 35 AO 

GGG GTT CGG GCA GCT GTG AAC TTC GGC ACC CGG GTG GCA ACC ATC GAC 675 
Gly Val Arg Ala Ala Val Asn Phe Gly Thr Arg Val Ala Thr lie Asp 
45 50 55 . 

ACC AGC GAG GCG GTC *GAC GCT GCC GCG CTG TGC CAG GCG GTC CGC CGC 723 
Thr Ser Glu Ala Val Asp Ala Ala Ala Leu Cys Gin Ala Val Arg Arg 
60 65. 70 

GCG GGC TAT CAG GCC GAT CTG TGC ACG GAT GAC GGT CGG AGC GCG AGT 771 
Ala Gly Tyr Gin Ala Asp Leu Cys Thr Asp Asp Gly Arg Ser Ala Ser 
75 80 85 

GAT CCG GAC GCC GAC CAC GCT CGA CAG CTG CTG ATC CGG CTA GCG ATC 819 
Asp Pro Asp Ala Asp His Ala Arg Gin Leu Leu He Arg Leu Ala He 
90 95 100 

GCC GCC GTG CTG TTT GTG CCC GTG GCC GAT CTG TCG GTG ATG TTT GGG 867 
Ala Ala Val Leu Phe Val Pro Val Ala Asp Leu Ser Val Met Phe Gly 
105 110 115 120 

GTC GTG CCT GCC ACG CGC TTC ACC GGC TGG CAG TGG GTG CTA AGC GCG 915 
Val Val Pro Ala Thr Arg Phe Thr Gly Trp Gin Trp Val Leu Ser Ala 
125 130 135 



i 



* 
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CTG GCA CTG CCG GTC GTG ACC TGG GCG GCG TGG CCG TTT CAC CGC GTT 
Leu Ala Leu Pro Val Val Thr Trp Ala Ala Trp Pro Phe His Arg Val 
140 145 150 

GCG ATG CGC AAC GCC CGC CAC CAC GCC GCC TCC ATG GAG ACG CTA ATC 
Ala Met Arg Asn Ala Arg His His Ala Ala Ser Met Glu Thr Leu He 
155 160 165 

TCG GTC GGT ATC ACG GCC GCC ACG ATC TGG TCG CTG TAG ACC GTC TTC 
Ser Val Gly He Thr Ala Ala Thr He Trp Ser Leu Tyr Thr Val Phe 
170 175 180 

GGC AAT CAC TCG CCC ATC GAG CGC AGC GGC ATA TGG CAG GCG CTG CTG 
Gly Asn His Ser Pro He Glu Arg Ser Gly He Trp Gin Ala Leu Leu 
185 190 195 200 

GGA AGC GAT GCT ATT TAT TTC GAG GTC GCG GCG GGT GTC ACG GTG TTC 
Gly Ser Asp Ala He Tyr Phe Glu Val Ala Ala Gly Val Thr Val Phe 
205 210 215 

GTG CTG GTG GGG CGG TAT TTC GAG GCG CGC GCC AAG TCG CAG GCG GGC 
Val Leu Val Gly Arg Tyr Phe Glu Ala Arg Ala Lys Ser Gin Ala Gly 
220 225 230 

AGT GCG CTG AGA GCC TTG GCG GCG CTG AGC GCC AAG GAA GTA GCC GTC 
Ser Ala Leu Arg Ala Leu Ala Ala Leu Ser Ala Lys Glu Val Ala Val 
235 240 245 

CTG CTA CCG GAT GGG TCG GAG ATG GTC ATC CCG GCC GAC GAA CTC AAA 
Leu Leu Pro Asp Gly Ser Glu Met Val He Pro Ala Asp Glu Leu Lys 
250 255 260 

GAA CAG CAG CGC TTC GTG GTG CGT CCA GGG CAG ATA GTT GCC GCC GAC 
Glu Gin Gin Arg Phe Val Val Arg Pro Gly Gin He Val Ala Ala Asp 
265 270 275 280 

GGC CTC GCC GTC GAC GGG TCC GCT GCG GTC GAC ATG AGC GCG ATG ACC 
Gly Leu Ala Val Asp Gly Ser Ala Ala Val Asp Met Ser Ala Met Thr 
285 290 295 

GGC GAG GCC AAA CCG ACC CGG GTG CGT CCG GGG GGG CAG GTC ATC GGC 
Gly Glu Ala Lys Pro Thr Arg Val Arg Pro Gly Gly Gin Val He Gly 
300 305 310 

GGC ACC ACA GTG CTT GAC GGC CGG CTG ATC GTG GAG GCG GCC GCG GTG 
Gly Thr Thr Val Leu Asp Gly Arg Leu He Val Glu Ala Ala Ala Val 
315 320 325 

GGC GCC GAC ACC CAG TTC GCC GGA ATG GTC CGC CTC GTT GAG CAA GCG 
Gly Ala Asp Thr Gin Phe Ala Gly Met Val Arg Leu Val Glu Gin Ala 
330 335 340 

CAG GCG CAA AAG GCC GAC GCA CAG CGA CTA GCC GAC CGG ATC TCC TCG 
Gin Ala Gin Lys Ala Asp Ala Gin Arg Leu Ala Asp Arg He Ser Ser 
345 350 355 360 





V. 
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GTG TTT GTT CCC GCT GTG TTG GTT ATC GCG GCA CTA ACC GCA GCC GGA 1635 
Val Phe Val Pro Ala Val Leu Val lie Ala Ala Leu Thr Ala Ala Gly 
365 370 375 

TGG CTA ATC GCC GGG GGA CAA CCC GAC CGT GCC GTC TCG GCC GCA CTC 1683 
Trp Leu lie Ala Gly Gly Gin Pro Asp Arg Ala Val Ser Ala Ala Leu 
380 385 390 

GCC GTG CTT GTC ATC GCC TGC CCG TGT GCC CTG GGG CTG GCG ACT CCG 1731 
Ala Val Leu Val lie Ala Cys Pro Cys Ala Leu Gly Leu Ala Thr Pro 
395 400 405 

ACC GCG ATG ATG GTG GCC TCT GGT CGC GGT GCC CAG CTC GGA ATA TTT 1779 
Thr Ala Met Met Val Ala Ser Gly Arg Gly Ala Gin Leu Gly lie Phe 
410 415 420 

CTG AAG GGC TAC AAA TCG TTG GAG GCC ACC CGC GCG GTG GAC ACC GTC 1827 
Leu Lys Gly Tyr Lys Ser Leu Glu Ala Thr Arg Ala Val Asp Thr Val 
425 430 435 440 

GTC TTC GAC AAG ACC GGC ACC CTG ACG ACG GGC CGG CTG CAG GTC AGT 1875 
Val Phe Asp Lys Thr Gly Thr Leu Thr Thr Gly Arg Leu Gin Val Ser 
445 450 455 

GCG GTG ACC GCG GCA CCG GGC TGG GAG GCC GAC CAG GTG CTC GCC TTG 1923 
Ala Val Thr Ala Ala Pro Gly Trp Glu Ala Asp Gin Val Leu Ala Leu 
460 465 470 

GCC GCG ACC GTG GAA GCC GCG TCC GAG CAC TCG GTG GCG CTC GCG ATC 1971 
Ala Ala Thr Val Glu Ala Ala Ser Glu His Ser Val Ala Leu Ala lie 
475 480 485 

GCC GCG GCA ACG ACT CGG CGA GAC GCG GTC ACC GAC TTT CGC GCC ATA 2019 
Ala Ala Ala Thr Thr Arg Arg Asp Ala Val Thr Asp Phe Arg Ala lie 
490 495 500 

4 

CCC GGC CGC GGC GTC AGC GGC ACC GTG TCC GGG CGG GCG GTA CGG _ GTG 2067 

Pro Gly Arg Gly Val Ser Gly Thr Val. Ser Gly Arg Ala Val Arg Val 

505 510 515 520 

GGC AAA CCG TCA TGG ATC GGG TCC TCG TCG TGC CAC CCC AAC ATG CGC 2115 
Gly Lys Pro Ser Trp lie Gly Ser Ser Ser Cys His Pro Asn Met Arg 
525 530 535 

GCG GCC CGG CGC CAC GCC GAA TCG CTG GGT GAG ACG GCC GTA TTC GTC 2163 
Ala Ala Arg Arg His Ala Glu Ser Leu Gly Glu Thr Ala Val Phe Val 
540 545 550 

GAG GTC GAC GGC GAA CCA TGC GGG GTC ATC GCG GTC GCC GAC GCC GTC 2211 
Glu Val Asp Gly Glu Pro Cys Gly Val lie Ala Val Ala Asp Ala Val 
555 560 565 

AAG GAC TCG GCG CGA GAC GCC GTG GCC GCC CTG GCC GAT CGT GGT CTG 2259 
Lys Asp Ser Ala Arg Asp Ala Val Ala Ala Leu Ala Asp Arg Gly Leu 
570 575 580 



• c • 
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CGC ACC ATG CTG TTG ACC GGT GAC AAT CCC GAA TCG GCG GCG GCC GTG 2307 

Arg Thr Met Leu Leu Thr Gly Asp Asn Pro Glu Ser Ala Ala Ala Val 
585 590 595 600 

GCT ACT CGC GTC GGC ATC GAC GAG GTG ATC GCC GAC ATC CTG CCG GAA 2355 
Ala Thr Arg Val Gly He Asp Glu Val He Ala Asp He Leu Pro Glu 
605 610 615 

GGC AAG GTC GAT GTC ATC GAG CAG CTA CGC GAC CGC GGA CAT GTC GTC 2403 
Gly Lys Val Asp Val He Glu Gin Leu Arg Asp Arg Gly His Val Val 
620 625 630 

GCC ATG GTC GGT GAC GGC ATC AAC GAC GGA CCC GCA CTG GCC CGT GCC 2451 
Ala Met Val Gly Asp Gly He Asn Asp Gly Pro Ala Leu Ala Arg Ala 
635 640 645 

GAT CTA GGC ATG GCC ATC GGG CGC GGC ACG GAC GTC GCG ATC GGT GCC 2499 
Asp Leu Gly Met Ala He Gly Arg Gly Thr Asp Val Ala He Gly Ala 
650 655 660 

GCC GAC ATC ATC TTG GTC CGC GAC CAC CTC GAC GTT GTA CCC CTT GCG 2547 
Ala Asp He He Leu Val Arg Asp His Leu Asp Val Val Pro Leu Ala 
665 670 675 680 

CTT GAC CTG GCA AGG GCC ACG ATG CGC ACC GTC AAA CTC AAC ATG GTC 2595 
Leu Asp Leu Ala Arg Ala Thr Met Arg Thr Val Lys Leu Asn Met Val 
685 690 695 

TGG GCA TTC GGA TAC AAC ATC GCC GCG ATT CCC GTC GCC GCT GCC GGA 2643 
Trp Ala Phe Gly Tyr Asn He Ala Ala He Pro Val Ala Ala Ala Gly 
700 705 710 

CTG CTC AAC CCC CTG GTG GCC GGT GCG GCC ATG GCG TTC TCA TCG TTC 2691 
Leu Leu Asn Pro Leu Val Ala Gly Ala Ala Met Ala Phe Ser Ser Phe 
715 720 725 

TTC GTG GTC TCA AAC AGC TTG CGG TTG CGC AAA TTT GGG CGA TAC CCG 2739 
Phe Val Val Ser Asn Ser Leu Arg Leu Arg Lys Phe Gly Arg Tyr Pro 
730 735 740 

CTA GGC TGC GGA ACC GTC GGT GGG CCA CAA ATG ACC GCG CCG TCG TCC 2787 
Leu Gly Cys Gly Thr Val Gly Gly Pro Gin Met Thr Ala Pro Ser Ser 
745 750 755 760 

GCG TGATGCGTTG TCGGGCAACA CGATATCGGG CTCAGCGGCG ACCGCATCCG 2840 
Ala 



GTCTCGGCCG AGGACCAGAG GCGCTTCGCC ACACCATGAT TGCCAGGACC GCG CCG ATC A 2900 

CCACCGGCAG ATGAGTCAAA ATCCGCGTGG TGCTGACCGC GCCGGACAGC GCATCCACAA 2960 

TCACATAGCC GGTCAGTATG GCGACGAACG CCGTCAGAAC ACCGGCCAGG CCGGCGGCGG 3020 

CGCTCGGCCA TAGCGCCGCG CCCACCATGA TCACACCGAG CGCAATCGAC CACGACGTGA 3080 
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CTCGTTGAGC AAGTGGGTGC CGGCACCCGT CGGGTGCTGA TGGGTCAGGC CGACGTCTAG 
GCCAAACCCC TGCACGGTGC CCAGGGCGAT CTGCGCGATG CCCACGCACA GCAACGCCCA 
ACGTCGCCAG GTCATCGGTG AATGTTGCCG CCGCGGCGCC CGGCGGATCC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Ala Ala Val Thr Gly Glu His His Ala Ser Val Gin Arg He 
15 10 15 

Gin Leu Arg He Ser Gly Met Ser Cys Ser Ala Cys Ala His Arg Val 
20 25 30 

Glu Ser Thr Leu Asn Lys Leu Pro Gly Val Arg Ala Ala Val Asn Phe 
35 40 45 

Gly Thr Arg Val Ala Thr He Asp Thr Ser Glu Ala Val Asp Ala Ala 
50 55 60 

Ala Leu Cys Gin Ala Val Arg Arg Ala Gly Tyr Gin Ala Asp Leu Cys 
65 70 75 80 

Thr Asp Asp Gly Arg Ser Ala Ser Asp Pro Asp Ala Asp His Ala Arg. 

85 90 95 

* 

Gin Leu Leu He Arg Leu Ala He Ala Ala Val Leu Phe Val Pro Val 
100 105 HO 

Ala Asp Leu Ser Val Met Phe Gly Val Val Pro Ala Thr Arg Phe Thr 
115 120 125 

Gly Trp Gin Trp Val Leu Ser Ala Leu Ala Leu Pro Val Val Thr Trp 
130 135 140 

Ala Ala Trp Pro Phe His Arg Val Ala Met Arg Asn Ala Arg His His 
145 150 155 160 

Ala Ala Ser Met Glu Thr Leu He Ser Val Gly He Thr Ala Ala Thr 
165 170 175 

He Trp Ser Leu Tyr Thr Val Phe Gly Asn His Ser Pro He Glu Arg 
180 185 190 

Ser Gly He Trp Gin Ala Leu Leu Gly Ser Asp Ala He Tyr Phe Glu 
s 195 200 205 



* 
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Val Ala Ala Gly Val Thr Val Phe Val Leu Val Gly Arg Tyr Phe Glu 
210 215 220 

Ala Arg Ala Lys Ser Gin Ala Gly Ser Ala Leu Arg Ala Leu Ala Ala 
225 230 235 240 

Leu Ser Ala Lys Glu Val Ala Val Leu Leu Pro Asp Gly Ser Glu Met 
245 250 255 

Val lie Pro Ala Asp Glu Leu Lys Glu Gin Gin Arg Phe Val Val Arg 
260 265 270 

Pro Gly Gin lie Val Ala Ala Asp Gly Leu Ala Val Asp Gly Ser Ala 
275 280 285 

Ala Val Asp Met Ser Ala Met Thr Gly Glu Ala Lys Pro Thr Arg Val 
290 295 300 

Arg Pro Gly Gly Gin Val lie Gly Gly Thr Thr Val Leu Asp Gly Arg 
305 310 315 320 

Leu lie Val Glu Ala Ala Ala Val Gly Ala Asp Thr Gin Phe Ala Gly 
325 330 335 

Met Val Arg Leu Val Glu Gin Ala Gin Ala Gin Lys Ala Asp Ala Gin 
340 345 350 

Arg Leu Ala Asp Arg lie Ser Ser Val Phe Val Pro Ala Val Leu Val 
355 360 365 

He Ala Ala Leu Thr Ala Ala Gly Trp Leu He Ala Gly Gly Gin Pro 
370 375 380 

Asp Arg Ala Val Ser Ala Ala Leu Ala Val Leu Val He Ala Cys Pro 
385 390 395 400 

Cys Ala Leu Gly Leu Ala Thr Pro Thr Ala Met Met Val Ala Ser Gly 
405 410 415 

Arg Gly Ala Gin Leu Gly He Phe Leu Lys Gly Tyr Lys Ser Leu Glu 
420 425 430 

Ala Thr Arg Ala Val Asp Thr Val Val Phe Asp Lys Thr Gly Thr Leu 
435 440 445 

Thr Thr Gly Arg Leu Gin Val Ser Ala Val Thr Ala Ala Pro Gly Trp 
450 455 460 

Glu Ala Asp Gin Val Leu Ala Leu Ala Ala Thr Val Glu Ala Ala Ser 
465 470 475 480 



Glu His Ser Val Ala Leu Ala He Ala Ala Ala Thr Thr Arg Arg Asp 
485 490 495 



Ala Val Thr Asp Phe Arg Ala He Pro Gly Arg Gly Val Ser Gly Thr 
500 505 510 
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Val Ser GlyVArg Ala Val Arg Val Gly Lys Pro Ser Trp lie Gly Ser 
515 \ 520 525 

Ser Ser Cys Hr« Pro Asn Met Arg Ala Ala Arg Arg His Ala Glu Ser 
530 \ 535 540 

Leu Gly Glu Thr Aa Val Phe Val Glu Val Asp Gly Glu Pro Cys Gly 
545 \ 550 555 560 



V 



Val He Ala Val Ala Asp Ala Val Lys Asp Ser Ala Arg Asp Ala Val 
565 \ 570 575 

Ala Ala Leu Ala Asp Arg \ly Leu Arg Thr Met Leu Leu Thr Gly Asp 
580 \ 585 590 

Asn Pro Glu Ser Ala Ala AlaVtfal Ala Thr Arg Val Gly He Asp Glu 
595 500 605 

Val He Ala Asp He Leu Pro Gl\i Gly Lys Val Asp Val He Glu Gin 
610 615 \ 620 

Leu Arg Asp Arg Gly His Val Val A%a Met Val Gly Asp Gly He Asn 
625 630 \ 635 640 

Asp Gly Pro Ala Leu Ala Arg Ala Asp fteu Gly Met Ala He Gly Arg 
645 650 655 

Gly Thr Asp Val Ala He Gly Ala Ala Asp \le He Leu Val Arg Asp 
660 665 \ 670 

His Leu Asp Val Val Pro Leu Ala Leu Asp LeuV^la Arg Ala Thr Met 
675 680 \ 685 

Arg Thr Val Lys Leu Asn Met Val Trp Ala Phe Gl^ Tyr Asn He Ala 
690 695 

Ala He Pro Val Ala Ala Ala Gly Leu Leu Asn Pro Leu Val Ala Gly 
705 710 715 \ - ~ 720 

Ala Ala Met Ala Phe Ser Ser Phe Phe Val Val Ser Asn ^er Leu Arg 
725 730 \ 735 

Leu Arg Lys Phe Gly Arg Tyr Pro Leu Gly Cys Gly Thr Val\Gly Gly 
740 745 750 



Pro Gin Met Thr Ala Pro Ser Ser Ala 
755 760 



